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Optimized Neural Networks to Search for Higgs Boson Production at the 
Tevatron. 

E. Boos, L. Dudko ^ 

'^Scobeltsyn Institute of Nuclear Physics of Moscow State University, 
Vorobyevy Gory, 119899, Moscow, Russia 

An optimal choice of proper kinematical variables is one of the main steps in using neural networks (NN) in high 
energy physics. Our method of the variable selection is based on the analysis of a structure of Feynman diagrams 
(singularities and spin correlations) contributing to the signal and background processes. An application of this 
method to the Higgs boson search at the Tevatron leads to an improvement in the NN efficiency by a factor of 
1.5-2 in comparison to previous NN studies. 



1. The basic idea 

In High Energy physics a discrimination be- 
tween a signal and its corresponding backgrounds 
by Neural Networks (NN) is especially remarkable 
when the data statistics are limited. In this case 
it is important to optimize all steps of the anal- 
ysis. One of the main questions which arises in 
the use of NNs is which, and how many variables 
should be chosen for network training in order 
to extract a signal from the backgrounds in an 
optimal way. The general problem is rather com- 
plicated and finding a solution depends on having 
a concrete process for making the choice, because 
usually it takes a lot of time to compare results 
from different sets of variables. 

One observation which helps in making the best 
choice of the most sensitive variables is to study 
the singularities in Feynman diagrams of the pro- 
cesses. Let us call those kinematic variables in 
which singularities occur as "singular variables". 
What is important to stress here is that most of 
the rates for both the signal and for the back- 
grounds come from the integration over the phase 
space region close to these singularities. One 
can compare the lists of singular variables and 
the positions of the corresponding singularities in 
Feynman diagrams for the signal process and for 
the backgrounds. It is obvious that if some of 
the singular variables are different or the posi- 
tions of the singularities are different for the same 



variable for the signal and for the backgrounds 
the corresponding distributions will differ most 
strongly. Therefore, if one uses all such singu- 
lar variables in the analysis, then the largest part 
of the phase space where the signal and back- 
grounds differ most will be taken into account. 
One might think that it is not a simple task to 
list all the singular variables when the phase space 
is very complex, for instance, for reactions with 
many particles involved. However, in general, all 
singular variables can be of only two types, either 
s-channel: M|-^ j2 = (p/i +p/2)^, where pfi and 
Pf2 are the four momenta of the final particles fl 
and /2 or t-channel: Uj = {pf — Pif' where p/ 
and Pi are the momenta of the final particle (or 
cluster) and the initial parton. For the tij all the 
needed variables can be easily found in massless 
case: iij = — Vse^p;^e~l^-f where s is the to- 
tal invariant mass of the produced system, and 
Y is the rapidity of the total system (rapidity of 
the center mass of the colliding partons), ptp and 
ijf are transverse momenta and pseudorapidity of 
the final particle /. The idea of using singular 
variables as the most discriminative ones is de- 
scribed in ^ and the corresponding method was 
demonstrated in practice in 

Singular variables correspond to the struc- 
ture of the denominators of Feynman diagrams. 
Another type of interesting variables corresponds 
to the numerators of Feynman diagrams and re- 
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fleets the spin efl^eets and the corresponding dif- 
ference in angular distributions of the final par- 
ticles. In order to discriminate between a signal 
and the backgrounds, one should choose in addi- 
tion to singular variables mentioned above those 
angular variables whose distributions are differ- 
ent for the signal and backgrounds. The set of 
these singular and angular variables will be the 
most efficient set for a NN analysis. 

The third type of useful variables which we 
called "Threshold" variables are related to the 
fact that various signal and background processes 
may have very different thresholds. Therefore 
the distributions over such kind of variables also 
could be very different keeping in mind that ef- 
fective parton luminosities depend strongly on s. 
The variable s would be a very efficient variable 
of that kind. However, the problem is that in 
case of neutrinos in the final state one can not 
measure s and should use the effective s which is 
reconstructed by solving t-,W-mass equations for 
the neutrino longitudinal momenta. That is why 
we propose to use not only the effective variable 
s but the variable Hi^*'' as well. 

To apply the method it is important to use 
a proper Monte-Carlo model of signal and back- 
ground events which includes all needed spin cor- 
relations between production and decays. For the 
following analysis we have calculated the com- 
plete tree level matrix elements for the back- 
ground processes with all decays and correlations 
by means of the CompHEP program The 
corresponding events are available at the FNAL 
Monte-Carlo events database 

2. Applying the method 

The present estimation of the expected sensi- 
tivities for the light Higgs boson search at the 
Tevatron by means of NNs is given in |H] . Based 
on the method described above we improve the 
efficiency of the NN technique. In the analysis 
we choose the Higgs boson mass to be Mh = 
115 GeV. We model the detector smearing by the 
SHW package 0. 

First of all we exclude ineffective variables from 
the old set , like from the PF-boson (shown 
at the left plot in Fig.QJ. After the correspond- 



ing analysis of Feynman diagrams and compar- 
ison of kinematical distributions we added the 
new variables for NN training. The example dis- 
tribution for the new variable {cos{zaxis, s)) is 
shown in the right plot of Fig. ^ At the next 
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Figure 1. Examples of the old kinematic variable 
(left plot) and the new one (right plot) 



step we constructed the set of NNs for pairs of 
the signal (WH) and each of the background 
from the complete set of principle backgrounds 
{Wbb, WZ, tt, tb{j)). 

The standard steps of NN training were used 
for the NNs with the old set of input variables and 
with the new one. Efficiencies of networks with 
different sets have been compared based on the 
criteria that for the better net the "Error func- 
tion" E — J^iLii'^i ~ Oj)^, where di and Oi are 
the desired and real outputs of the net and A'' is 
the number of test events, is smaller. Two ex- 
amples of distributions you can see in Fig [21 for 
the WH - tt network (left plot) and WH - WZ 
network (right plot). One can see a significant im- 
provement for the networks with new input sets 
in comparison with old sets of variables, since the 
corresponding curves of the error function are sig- 
nificantly lower. 

3. Results 

Based on the described method we have con- 
structed the new NNs to search for a light Higgs 
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boson at the Tevatron. After checking the im- 
provement in efhciency of new networks we rec- 
ommend the new sets of input variables for NNs, 
which are shown below: 

• Wbb ~ WH 

NN: A%, s, 
cos{bl, b2)\iab, cos{bl, 6162)|f,ih2 

• WZ - WH 

NN: M,-„ P^\ H^''\ 

cos{bl,b2)\iab, Qxcos{z,bl)\iab, cos{W,e)\w 

• tt- WH 

NN: Mft^, Mwb, s, Mwjets-b, H^t^\ 
Q X cos{2lJaxis,e)\top, cos{bl, blb2)\bib2 

• tbj, tb - WH 

NN: Mfcs, Mwb, P^^ s, 

Mwjets-b, Pt^, H'r^*\ cos{z,e)\iab, 

Q y. cos{z,bl)\top, cos{e,j)\top 

where there are three types of variables: 

• "Singular" variables (denominator of Feyn- 
man diagrams): 

Mi2 is the invariant mass of two particles 
and/or jets (1 and 2) and corresponds to 
s-channel singularities; 

(the transverse momenta of f) ; 
Mwjets-b is the invariant mass of the W 
and all jets except the 6-jet for which the 
Mt — {pw + Pb) is closest to the top quark 
mass; 

• "Angular" variables (numerator of 
Feynman diagrams, spin effects): 
cos(&l, 6162)|bib2 means the cosine of the 
angle between highest Pt b-quark and vec- 
tor sum of the two highest Pt b-quarks 
in the rest frame of these two b-quarks. 
Scalar (Higgs) and vector (gluon, Z-boson) 
particle decays lead to significantly differ- 
ent distributions on this variable, this is 



also very much different for the case when 
b-quarks come from the decay of top and 
anti-top quarks; 

cos{bl,b2)\iab characterizes how much two 
b-quarks are coUinear; 
cos{z,bl)\iab and cos{W,e)\w reflect the 
difference in t-channel Z-boson and s- 
channel Higgs-boson production topologies 
where lab means the laboratory rest frame 
and z means the z-axis; 
cos{il)axis,e)\top and cos{e,j)\top |B] are 
the top quark spin correlation variables 
used in the analysis of the top quark pair 
and single production, the Icpton charge Q 
is added here to take uniformly into account 
the electron and the positron contributions 
from the VK-boson decays. 

• "Threshold" variables. As explained above 
the s and Hi^*"^ variables are used in our 
analysis. 

As one can see from the Fig|21using the new NN 
variables allows to improve the NN efficiency by a 
factor of 1.5-2 depending on the background pro- 
cess. It will lead to corresponding improvement 
in prospects to find a light Higgs at the Tevatron. 
However, one needs to take into account the ZH 
production channel as well as a number of de- 
tector efficiencies in order to predict a realistic 
discovery limit. 
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